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format optimised for telephone numbers or for integers is used. 
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..SPECIFICATION are simple digital data, it is not difficult to store all 
the data. 

However, the compression /decompression system in the GIF is based on 
the LZW system of a dictionary having a maximum number of entries 
of 4096. Further, among the 4096 entries, the number of empty entries or 
the contents of the entries (second dictionary ) are subjected to change 



during the process of decompressing the same image. Furthermore, the 
dictionary. . . 



. SPECIFICATION are simple digital data, it is not difficult to store all 
the data. 

However, the compression /decompression system in the GIF is based on 
the LZW system of a dictionary having a maximum number of entries 
of 4096. Further, among the 4096 entries, the number of empty entries or 
the contents of the entries (second dictionary ) are subjected to change 
during the process of decompressing the same image. Furthermore, the 
dictionary. . . 
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ABSTRACT EP 729237 A2 

A system and method for compressing and decompressing data using a 
plurality of data compression mechanisms. Representative samples of each 
block of data are tested to select an appropriate one of the data 
compression mechanisms to apply to the block. The block is then 
compressed using the selected one of the mechanisms and the compressed 
block is provided with an identifier of the selected mechanism. For 
decompression, the identifier is examined to select an appropriate one 
of the data decompression mechanisms to apply to the block. The block is 
then decompressed using the selected one of the mechanisms. (see image 
in original document) 
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...SPECIFICATION be fixed to a predetermined set. For example, this set 
could be initially empty, and dictionaries generated by a dictionary 
-based compression method could be added to the set as subsequent 
blocks are compressed , until a maximum number of dictionary 
blocks is reached. 

Furthermore, if there are currently no compressed blocks requiring a 
given dictionary block for de- compression (this can be determined by 
examining the CMD fields), the given dictionary block can be removed 
from the current set of dictionary blocks. 

Among other possibilities, the. . . 
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ABSTRACT EP 678986 Al 

A lossless type data compression method employing a dictionary system 
is suitable for character generator of a game machine and so forth. A 
working data strings are generated from an original data stream. Two 
sequential working data strings are combined to form a combined string. A 
dictionary is generated by registering the combined strings having 
occurrence frequency higher than a given value with a dictionary number. 
The combined strings in the data stream are replaced with the dictionary 
numbers corresponding to the combined strings in the dictionary, (see 
image in original document) 
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ABSTRACT EP 666651 A2 

A class of lossless data compression algorithms use a memory-based 
dictionary (312) of finite size to facilitate the compression and 
decompression of data. To reduce the loss in data compression caused by 
dictionary resets, a standby dictionary (328) is used to store a subset 
of encoded data entries previously stored in a current dictionary. In a 
second aspect of the invention, data is compressed/decompressed 
according to the address location of data entries contained within a 
dictionary built in a content addressable memory (CAM) (312) . In a third 
aspect of the invention, the minimum memory/high compression capacity 



of the standby dictionary scheme is combined with the fast single-cycle 
per character encoding/decoding capacity of the CAM circuit. In a fourth 
aspect of the invention, a selective overwrite dictionary swapping 
technique is used to allow all data entries to be used at all times for 
encoding character strings (450-472). (see image in original document) 
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Significantly improves the performance of a processor executing the 
well-known Ziv-Lempel (ZL) data compression/expansion algorithm by 
providing a novel structure for ZL dictionaries, and a novel SZL (static 
Ziv-Lempel) process for using an SZL dictionary (s ) in a static manner to 
compress and/or expand records randomly accessed from a data base without 
spending processing on modifying the dictionary. An SZL dictionary is 
generated by a pass over a data base (sampling some or all of the records 
in the data base) before the dictionary is used for 

compression/expansion. (This is unlike the adaptive Ziv-Lempel, AZL, 
dictionary in the prior art which is generated while compressing data, 
during which it is continuously being "adapted" (changed) by the data it 
is compressing.) Entries in an SZL dictionary have novel internal 
structures for reducing processing time, in which a single entry may 
include extension characters, plural child characters or sibling 
characters, along with information fields and unique control fields that 
enable compression or expansion determinations to be made without 
accessing other entries involved in the determinations, which enables the 
SZL process to have fewer memory accesses per compression symbol for 
faster compression and expansion operations. Compression/expansion 
operations are also speeded up by accessing multiple characters per 
operation, enabled by matching the size of each dictionary entry to the 
data unit size accessed from the computer memory. The novel SZL process 
operates most efficiently with a separate compression dictionary and a 
separate expansion dictionary. However, a single SZL dictionary may be 
constructed to provide both compression and expansion, (see image in 
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...SPECIFICATION are numbered consecutively after the features described 
above. The further features include: 

• 15. Providing separate compression and expansion dictionaries 
instead of a single dictionary . This allows optimizing the contents 
of each dictionary for its respective purpose, thus achieving the best 
possible performance during each of compression and expansion. The 
compression dictionary can contain more PECs than in the single 
dictionary case. The compression dictionary can contain more CCs than 
in the single dictionary case. Since SDs are still available... 
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ABSTRACT EP 573208 Al 

A class of lossless data compression algorithms use a memory-based 
dictionary of finite size to facilitate the compression and decompression 
of data. To reduce the loss in data compression caused by dictionary 
resets, a standby dictionary is used to store a subset of encoded data 
entries previously stored in a current dictionary. In a second aspect 
data is .compressed/decompressed according to the address location of data 
entries contained within a dictionary built in a content addressable 
memory (CAM) . In a third aspect the minimum memory/high compression 
capacity of the standby dictionary scheme is combined with the fast 
single-cycle per character encoding/decoding capacity of the CAM circuit. 
The circuit uses multiple dictionaries within the storage locations of a 



CAM to reduce the amount of memory required to provide a high data 
compression ratio, (see image in original document) 
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..SPECIFICATION 21. Line 440 is the graphical representation of the 
compression ratio for the CAM multi- dictionary system and line 442 
illustrates the compression ratio for a standard LZW algorithm. Lines 4 40 
and 442 plot compression ratio (original file size/ compressed file 
size) as a function of b, the maximum number of bits in the output 
codes (i.e. log2)) of the dictionary size). 

To emphasize the advantage of the CAM/standby dictionary method, a 
dashed line 444 . 



22/5, K/21 (Item 21 from file: 348) 

DIALOG (R) File 348 : EUROPEAN PATENTS 

(c) 2005 European Patent Office. All rts. reserv. 

00544075 

Dictionary reset performance enhancement for data compression applications 
Erhohung der Leistungsf ahigkeit beim Rucksetzten von Worterbuchern fur 

Datenkompressionsanwendungen 
Augmentation de performance lors de 1 ' initialisation d'un dictionnaire pour 

applications en compression de donnees 
PATENT ASSIGNEE: 

Hewlett-Packard Company, (206030), 3000 Hanover Street, Palo Alto, 
California 94304, (US), (applicant designated states: DE; FR; GB; IT) 
INVENTOR: 

Lantz, Carl B., 619 N.W. 15th Street, Corvallis, OR' 97330, (US) 
LEGAL REPRESENTATIVE: 

Colgan, Stephen James et al (29461), CARPMAELS & RANSFORD 43 Bloomsbury 
Square, London WC1A 2RA, (GB) 
PATENT (CC, -No, Kind, Date): EP 534713 A2 930331 (Basic) 

EP 534713 A3 931110 
EP 534713 Bl 980318 
APPLICATION (CC, No, Date) : EP 92308614 920922; 
PRIORITY (CC, No, Date) : US 766475 910925 
DESIGNATED STATES: DE; FR; GB; IT 
INTERNATIONAL PATENT CLASS: H03M-007/30; 

CITED PATENTS (EP A) : WO 8912363 A; WO 8912363 A; US 4053948 A 



ABSTRACT EP 534713 A2 

A class of lossless data compression algorithms use a RAM-based 
dictionary to facilitate the compression and decompression of data. The 
dictionary is initialized or reset at various events, such as data file 
boundaries, or at specific compression ratio thresholds. The average time 
to perform the dictionary reset is decreased by using a dictionary reset 
optimizer (12) to reset the dictionary without writing the DICT VALID 
fields (19) of all dictionary locations (1,2,3,..L) immediately upon each 
reset. The DICT VALID field in static RAM (16) is set up as an N-bit 
field and the optimizer includes a counter (20) for generating an N-bit 
DICT NUMBER field (22) under control of a controller (24) responsive to 
reset signals. During operation to compress/decompress data after a 
reset, the DICT VALID field of a dictionary location is compared to the 
current DICT NUMBER ( 1 ) . If their values are equal, the DICT ENTRY field 
(18) in such location is deemed valid. Otherwise, the DICT ENTRY field is 
rewritten with new compression data and the DICT VALID field is reset 
equal to the current DICT NUMBER (1) . Upon each reset, the counter is 
incremented to provide a new DICT NUMBER (2) whose value will no longer 
match current DICT VALID values stored in the dictionary locations, (see 
image in original document) 
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...SPECIFICATION the matching function. In order to have an efficient 
hashing algorithm, the number of valid dictionary locations in the 
static RAM is much larger than the maximum number of valid 
dictionary entries. A factor of two to four is typical. 

The data compression engine 7 provides the address for the 
dictionary location (1, 2, 3,...L) to be accessed, as well as the read 
and write . . . 
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Data compression and decomp 
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in which alternative symbols 
(a,b,c) are linked by linking 
symbols ( ab, be, ca, aba , abb, abc 
second type. For example the 
symbols a,b and c grouped tog 
final symbol of the sequence 
pair of D and R pointers, in ' 
identifies its parent. Symbol 
are pruned from the tree and 
Figure 8 (b) . 
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...SPECIFICATION selected by the processor. 

The above described embodiments employ a method of deletion which is 
highly efficient in terms of memory utilization and execution speed 
although it is slightly sub- optimal in terms of compression 
efficiency. The embodiments are formed from the two data structures used 
to build a dictionary , the systematic search tree structure used to 
represent strings and the systematic tabular representation of... 
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English Abstract 



The present invention provides a novel method and system for obtaining 
maximum system speed performance for compressing coded data. The novel 
pointer address used comprises a string code portion representative of 
the last matched string found in the dictionary and appended thereto an 
extension character code portion representative of the next character 
taken from the input data stream. The dictionary is constructed large 
enough to provide a unique memory location for every possible pointer 
address. The unique pointer address, representative of a string of data 
in the input data stream, is employed as an address pointer to access the 
memory. If no data is in memory at the address pointer the string has not 
been previously observed and a new unique compression coded is stored at 
the pointer address in memory for the string so that the next identical 
pointer address will read the code for the string from the dictionary. 

French Abstract 

Cette invention concerne un procede et un systeme permettant de maximiser 
la vitesse de compression de donnees codees. Selon cette invention, 
l'adresse de pointeur utilisee comprend une partie code de chaine 
representative de la derniere chaine correspondante localisee dans le 
dictionnaire et accrochee a une partie rallonge de code de caractere 
representative du caractere suivant pris dans le train de donnees 
d' entree. De par sa conception, ce dictionnaire est suffisamment 
important pour qu'un emplacement de memoire unique soit disponible pour 
chaque adresse de pointeur possible. Cette adresse de pointeur unique, 
qui est representative d ! une chaine de donnees dans le flux de donnees 
d' entree, sert d* adresse de pointeur pour acceder a la memoire. Si aucune 
donnee ne se trouve dans la memoire au niveau du pointeur d 1 adresse, 
c'est que la chaine n ! a pas ete observee precedemment . Une nouvelle 
compression unique codee est stockee dans la memoire au niveau de 
•l'adresse de pointeur pour cette chaine de sorte que la prochaine adresse 
de pointeur identique lira le code dans le dictionnaire pour ladite 
chaine. 

Legal Status (Type, Date, Text) 
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Examination 20010719 Request for preliminary examination prior to end of 

19th month from priority date 

Fulltext Availability: 
Detailed Description 

Detailed Description 
. . . 65rOOO of the 

possible 22 4addresses are filled, it is now possible to 
configure a dictionary of the type shown in Figure 8 to 

optimize the number of code compression bits between 9 and 
16. Once the dictionary is filled with a maximum number of 
codes allowed by the number of code bits selected the data 

compression system transmits a reset code to the decoder or 
receiver 
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English Abstract 

The system creates an alphabetically ordered main dictionary listing all 
unique words appearing in the text file. A text file (16) "word" is 
defined as a sequence of characters ending with one or more "word 
terminators" such as spaces, commas, periods and carriage returns. The 
compression system also creates a common word dictionary (22) referencing 
words most often encountered in the text file. The sequence of words 
forming the text file is represented by a word index, a list of one byte 
and two byte references to common and main dictionary words, 
respectively. The system compresses the main dictionary (26) of each 
dictionary word matching leading characters of a next preceding 
dictionary word are represented by data indicating the number of matching 
characters. Second, commonly encountered dictionary word suffixes are 
represented by data referencing entries of a small suffix dictionary 
(30) . Third, remaining characters of main dictionary words are 
represented by bytes encoded to represent commonly encountered characters 
and groups of characters. 

French Abstract 

L' invention concerne un systeme servant a comprimer un fichier texte 
ASCII ou un fichier texte code de'facon similaire. Le systeme permet de 
creer un dictionnaire principal classe par ordre alphabetique 
repertoriant tous les mots particuliers apparaissant dans le fichier 
texte. Un fichier texte "mot" est defini comme etant une sequence de 
caracteres se terminant par une ou plusieurs "terminaisons de mot", tels 
des espaces, des virgules, des points et des retours a la ligne. Le 
systeme de compression permet aussi de creer un dictionnaire de mots 
courants renvoyant a des mots apparaissant le plus frequemment dans le 
fichier texte. La sequence de mots formant le fichier texte est 
representee par un index de. mots, une liste de renvois d'un ou de deux 
octets a des mots du dictionnaire principal ou du dictionnaire de mots 
courants, respectivement . Le systeme comprime le dictionnaire principal 
au moyen de trois techniques complementaires . Premierement , des 
caracteres de tete de chaque mot de dictionnaire correspondant a des 
caracteres de tete d'un mot suivant du dictionnaire precedent sont 
representes par des donnees indiquant le nombre de caracteres 
correspondants . Deuxiemement , des suffixes de mots de dictionnaire 
apparaissant regulierement sont representes par des donnees renvoyant a 
des entrees d f un petit dictionnaire de suffixes. Troisiemement , des 
caracteres restants de mots de dictionnaire principal sont representes 
par des octets codes pour representer des caracteres et des groupes de 
caracteres apparaissant regulierement. Le systeme comprime egalement des 
structures de donnees de style souvent incluses dans des fichiers texte 
de traitements de texte. 
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Detailed Description 



Detailed Description 

the uncompressed dictionary the system creates a 
corresponding, usually much smaller, entry of the compressed 

dictionary . The system employs three compression techniques, 
described below, which cooperate to produce a relatively high 
dictionary compression ratio. 

Leading Character Compression 

As mentioned above, the main dictionary entries are 
entered in alphabetical order to maximize the number of 
matching leading characters. The first dictionary 

compression technique makes use of the fact that since the » 
main dictionary is alphabetized, the first 1 to 15 characters 
of each word entry are likely to. . . 
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Claims 
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English Abstract 

Data compression and decompression utilising e.g. the Ziv-Lempel 
algorithm is simplified by utilising a tree structure for the dictionary 
in which alternative symbols at a given position in a symbol sequence (a, 
b, c) are linked by linking pointers R of a first type and successive 
symbols (ab, be, ca, aba, abb, abc) are linked by linking pointers D of a 
second type. For example the sequence ab may continue with any one of the 
symbols a, b and c grouped together by R pointers in a list below the 
final symbol of the sequence ab. Each symbol is defined by an associated 
pair of D and R pointers, in conjunction with a parent pointer P which 
identifies its parent. Symbols having no D pointers extending therefrom 



are pruned from the tree and transferred to a free list as shown in 
Figure (8b) . 

French Abstract 

La compression et la decompression de donnees au moyen par exemple de 
1 ' algorithme de Ziv-Lempel est simplifiee par l ! emploi, pour le 
dictionnaire, d'une structure arborescente dans laquelle des variantes de 
symbole, a un emplacement donne dans une sequence de symboles (a, b, c) , 
sont lies par des indicateurs de liaison R d'un premier type, et des 
symboles successifs (ab, be, ca, aba, abb, abc) sont lies par des 
indicateurs de liaison D d ! un second type. La sequence ab peut par 
exemple se poursuivre avec n ! importe lequel des symboles a, b et c 
regroupes par des indicateurs R dans une liste au-dessous du symbole 
final de la sequence ab. Chaque symbole est defini par une paire associee 
d' indicateurs D et R, conjointement avec un indicateur pere P qui 
identifie sa parente. Les symboles d'ou ne partent pas d ! indicateurs D 
sont elagues de la structure arborescente et transferes sur une liste 
libre comme le montre la figure (8b) . 

Fulltext Availability: 
Detailed Description 

Detailed Description 

highly efficient in terms of memory 
utilization and execution speed although it is slightly 
sub- optimal in terms of compression efficiency. The 
embodiments are formed from the two data structures used 
to build a dictionary , the systematic search tree 
structure used to represent strings and the systematic 
tabular representation of. . . 
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Datenkomprimierungs - and data decompression scheme under application of a 
search tree, with which each entry is stored with a sign chain of 
infinite length, 

Datenkomprimierungs- und Da tendekomprimierungs schema unter Verwendung eines 
Suchbaums, bei dem jeder Eintrag mit einer Zeichenkette unendlicher 
Lange gespei chert ist 
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Abstract (English machine translation) 

Data compression appliance, extensive: a data storage (104). 204, to 
storing of complaint data, a code storage (105). 205, to storing from 
through the compression of complaint data of gotten codes, a lexicon 

(100) with a multiplicity of entries, in which can a first pointer (301) 
be stored in each case, the one at will tall sign string of the complaint 
data stored in the data storage shows, with what does the lexicon be 
expressed through a search tree, the one main knots (302) . 305, to 
storing the first pointer (301) and a turn-off knot (303) . 306, to 
representing a turn-off at the search tree includes, a search equipment 

(101) for the sign chain with the longest agreement about seeking a sign 
chain with the longest agreement, this with one over a longest length to 
coding sign chain agrees, under the sign chains stated through the 
respective first pointers (301) . with what is the longest length the 
longest accordance length, a Codiereinrichtung (102) to coding the 
longest accordance length and an index of an entry, he/it a first pointer 
(301) stores, this the sign chain with the longest agreement boasts, and 
to the letter of a code yielding itself into the code storage (105) . 205. 
and a lexicon updating equipment (103) for adding an entry to the 
lexicon, this him/it the encoded sign chain of boasting first pointers 
(301) stores, with what does the lexicon (100) allow the registration of 
the entries without limitation of their lengths through storages of the 
first pointers (301) in the respective entries, one through what in the 
data storage (104). 204, stored sign chain found with an arbitrary length 
and can be coded, in that an index of an entry and the longest accordance 
length are marked. 

Abstract (German) 

Datenkomprimierungsvorrichtung. umfassend: einen Datenspeicher (104. 204) 
zum Speichern von Eingabedaten . einen Codespeicher (105. 205) zum 
Speichern von durch die Komprimierung von Eingabedaten erhaltenen Codes . 
ein Lexikon (100) mit einer Vielzahl von Eintragen. in welchen jeweils 
ein erster Zeiger (301) gespeichert werden kann. der eine beliebig lange 
Zeichenkette der in dem Datenspeicher gespeicherten Eingabedaten anzeigt . 
wobei das Lexikon durch einen Suchbaum ausgedruckt ist. der einen 
Hauptknoten (302. 305) zum Speichern des ersten Zeigers (301) und einen 
Abzweigungsknoten (303. 306) zum Darstellen einer Abzweigung bei dem 
Suchbaum umfasst. eine Sucheinrichtung (101) fur die Zeichenkette mit der 
langsten Ubereinstimmung zum Suchen einer Zeichenkette mit der langsten 
Ubereinstimmung . die mit einer uber eine langste Lange zu codierenden 
Zeichenkette ubereinstimmt . unter den durch die jeweiligen ersten Zeiger 
(301) angegebenen Zeichenketten . wobei die langste Lange die langste 
Ubereinstimmungslange ist. eine Codiereinrichtung (102) zum Codieren der 
langsten Ubereinstimmungslange und eines Indexes eines Eintrags. der 
einen ersten Zeiger (301) speichert . der die Zeichenkette mit der 
langsten Ubereinstimmung angibt . und zum Schreiben eines sich ergebenden 
Codes in den Codespeicher (105. 205). und eine 

Lexikon-Aktualisierungseinrichtung (103) zum Hinzufugen eines Eintrags zu 
dem Lexikon. der den die codierte Zeichenkette angebenden ersten Zeiger 
(301) speichert. wobei das Lexikon (100) die Registrierung der Eintrage 
ohne Beschrankung von deren Langen durch Speichern der ersten Zeiger 
(301) in den jeweiligen Eintragen gestattet. wodurch eine in dem 
Datenspeicher (104. 204) gespeicherte Zeichenkette mit einer 
willkurlichen Lange gefunden und codiert werden kann. indem ein Index 
eines Eintrags und die langste Ubereinstimmungslange bezeichnet werden. 



Fulltext Availability: 

Description (English machine translation) 

Description (English machine translation) 

... the knots of a search tree (d. h. the number of the entries of the 
lexicon 100) should be restricted to the prominent way. 

The maximum number of the entries is determined by a compromise 
between the storage costs and the compression relationship. D. h. if 
the maximum number of the entries is increased, the compression 
relationship is improved on the basis of an increase of the number of the 
sign. . . 

...1000, it is judged whether the number of he/it reached 200 registered 
entries a maximum number of the entries in the conversion table, that 
in same way as for this in the compression proceeding would use 
lexicon is put in. 



If a judgment result of step 1000 boasts, that the conversion table. 
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is used in conjunction with maintaining a separate hash table for each 
packet. Further, the per-packet hash table indexes particular byte 
strings in the packet but does not index data in any other packet (s). 
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.SPECIFICATION embodiment also maintain singly linked offset chains 
terminated by an invalid entry, and recycle hash tables by changing the 
valid offset range for the hash table (not by clearing the entire 
table ) . Further, as will appreciated, a compression level parameter, 
e.g., 8 strings, is utilized to determine the maximum number of 
strings that are checked while searching for a match . 

Advantageously, in accordance with an aspect of the invention, 
increased robustness and greater compression ratios ... embodiment also 
maintain singly linked offset chains terminated by an invalid entry, and 
recycle hash tables by changing the valid offset range for the hash 
table (not by clearing the entire table ) . Further, as will 
appreciated, a compression level parameter, e.g., 8 strings, is 
utilized to determine the maximum number of strings that are checked 
while searching for a match . As will be appreciated, the compression 

level parameter is an indicator for determining the overall computation 
level used during compression, the... 
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packet. Further, the per-packet hash table indexes particular byte 



strings in the packet but does not index data in any other packet (s). 
That is, a respective separate hash table for each packet is employed 
wherein such hash table is constructed as the particular packet is 
compressed. As such, the respective hash table is encoded with the 
particular packet. Employing a per-packet hash table in combination with 
variable history state inter-packet compression provides for efficient 
and robust overall compression of the packets. 
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.SPECIFICATION embodiment also maintain singly linked offset chains 
terminated by an invalid entry, and recycle hash tables by changing the 
valid offset range for the hash table (not by clearing the entire 
table ). Further, as will appreciated, a compression level parameter, 
e.g., 8 strings, is utilized to determine the maximum' number of 
strings that are checked while searching for a match . 

Advantageously, in accordance with an aspect of the invention, 
increased robustness and greater compression ratios ... embodiment also 
maintain singly linked offset chains terminated by an invalid entry, and 
recycle hash tables by changing the valid offset range for the hash 
table (not by clearing the entire table ) . Further, as will 
appreciated, a compression level parameter, e.g., 8 strings, is 
utilized to determine the maximum number of strings that are checked 
while searching for a match . As will be appreciated, the compression 

level parameter is an indicator for determining the overall computation 
level used during compression, the... 

/.SPECIFICATION embodiment also maintain singly linked offset chains 
terminated by an invalid entry, and recycle hash tables by changing the 
valid offset range for the hash table (not by clearing the entire 
table ) . Further, as will appreciated, a compression level parameter, 
e.g., 8 strings, is utilized to determine the maximum number of 
strings that are checked while searching for a match . 

Advantageously, in accordance with an aspect of the invention, 
increased robustness and greater compression ratios ... embodiment also 



maintain singly linked offset chains terminated by an invalid entry, and 
recycle hash tables by changing the valid offset range for the hash 
table (not by clearing the entire table ) . Further, as will 
appreciated, a compression level parameter, e.g., 8 strings, is 
utilized to determine the maximum number of strings that are checked 
while searching for a match . As will be appreciated, the compression 

level parameter is an indicator for determining the overall computation 
level used during compression, the... 

...CLAIMS The method of claim 10 further comprising 

identifying a particular one of byte substrings by searching each 
respective hash table of the packets, the searching being performed 
as a function of the respective packet history state of the packet. 

12. The method of claim 11 wherein a maximum number of searches 
made during the searching of the respective hash tables is 
determined using a compression level parameter. 

13. The method of claim 12' wherein the compression level parameter is 
defined. . . 

...series of byte substrings such that particular ones of the byte 

substrings are identified by searching the respective hash tables 
of ...packet history state associated with the packet. 

21. The . apparatus of claim 20 wherein a maximum number of searches 
performed during the searching is established using a compression 
level parameter. 

22. The apparatus of claim 21 wherein the communications channel is part 
of. . . 

...CLAIMS series of byte substrings such that particular ones of the byte 
substrings are identified by searching the respective hash tables 
of the packets, the searching being performed as a function of the 
respective packet history state associated with the packet. 

14. The apparatus of claim 13 wherein a maximum number of searches 
performed during the searching is established using a compression 
level parameter. 

15. The apparatus of claim 14 wherein the communications channel is part 
of. . . 
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English Abstract 

A method for compressing a routing table involves constructing a binary 
tree representation of the routing table. The compression method makes 
three passes through the tree. In a first pass, the compression method 
propagates routing information down to the tree leaves. During this pass, 
the program assigns every leaf node in the tree an associated next hop or 
an inherited next hop from a higher level ancestral node. In a second 
pass, the compression method migrates prevalent next hops up the tree. 
This bottom up pass involves forming a set of next hops at a parent node 
by supernetting the sets of next hops A and B for a pair of child nodes 
corresponding to the parent node, according to the following operation: 
i (A*B = A) (union) i (B, if A) (intersection) i (B = )phi and 

i (A) (intersection) i (B, if A) (intersection) 1 (B) (not equal) phi where A*B 
is a set of next hops formed at the parent node. In the third pass, the 
compression method eliminates redundant branches in the tree. This top 
down pass begins at a parent node and selects a next hop from a parent 
node. The method then examines a child node branching from the parent 
node to determine whether the selected next hop is an element of next 
hops for the child node. If it is, the method eliminates the next hops 
for the child node. After the tree is restructured by the three-pass 
process, the compression method converts it back to a new routing table. 

French Abstract 

Cette invention concerne un procede de compression d ! une table de 
routage qui consiste a construire une representation en arbre binaire de 
la table de routage. Le procede de compression consiste a effectuer trois 
passages a travers l 1 arbre. Lors du premier passage, le procede de 
compression permet de propager des informations de routage vers le bas en 
direction des feuilles de l 1 arbre. Lors de ce passage, le programme 
attribue a chaque noeud de feuille de 1 ' arbre un saut suivant associe ou 
un saut suivant herite d ! un noeud ancestral d'un niveau plus eleve. Lors 
du deuxieme passage, le procede de compression permet de faire migrer les 
sauts suivants prevalants vers le haut de l f arbre. Ce passage du bas vers 
le haut consiste a former un ensemble de sauts suivants au niveau d ! un 
noeud parent en surreseautant les ensembles de sauts suivants A et B pour 
deux noeuds de descendance correspondant au noeud parent, ceci selon 
I'operation i (A*B=A) (union) i (B si A) ( intersection) i (B=) phi 
eti (A) (intersection) i (B si A) (intersection) i (B) (not equal) phi, etant 
entendu que A*B represente un ensemble de sauts suivants forme au niveau 
du noeud parent. Lors du troisieme passage, le procede de compression 
permet d ! eliminer les branches redondantes de l 1 arbre. Ce passage du haut 
vers le bas debute au niveau d'un noeud parent et permet de choisir un 
saut suivant depuis un noeud parent. Le procede consiste ensuite a 
examiner un embranchement de noeud de descendance depuis le noeud parent 
afin de determiner si le saut suivant choisi est un element des sauts 
suivants pour le noeud de descendance. Si tel est le cas, le procede 
permet d'eliminer les sauts suivants pour le noeud de descendance. Une 
fois 1' arbre restructure par le traitement en trois passages, le procede 



de compression consiste a le transformer a nouveau en une nouvelle table 
de routage. 

Fulltext Availability: 
Detailed Description 

Detailed Description 

... root and the less prevalent next hops down toward the leaves, while 
concurrently pruning the maximum number of routes from the tree. 
The compression process, given- a routing table that provides 
forwarding 

information for IP addresses using longest prefix match , provides a 
routing table which (a) provides the same forwarding information and 
(b) has the least possible number of... 
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Abstract (English machine translation) 

An encyclopedia (100) exhibits a large number of entries in each case for 
storing a pointer, which indicates the head of one in a data memory (104, 
204) to stored character string of input data. Under the character 
strings indicated by the respective pointers for a character string with 
the longest agreement, which with one over a longest length (i.e. a 
longest agreement length) character string which can be coded agrees, one 
searches. The longest agreement length and an index of an entry, which 
stores a pointer, which indicates the head of the character string as the 
longest agreement, coded, and an entering code written into a code memory 
(105, 205) . An entry, which stores the pointer, which indicates the head 



of the coded character string, the encyclopedia (100) added, in order to 
update it. 
Abstract (German) 

Ein Lexikon (100) weist eine Vielzahl von Eintragen jeweils zum Speichern 
eines Zeigers auf, der den Kopf einer in einem Datenspeicher (104, 204) 
gespeicherten Zeichenkette von Eingabedaten angibt . Unter den durch die 
jeweiligen Zeiger angegebenen Zeichenketten wird nach einer Zeichenkette 
mit der langsten Ubereinstimmung, die mit einer uber eine langste Lange 
(d. h. einer langsten Ubereinstimmungslange) zu codierenden Zeichenkette 
ubereinstimmt , gesucht. Die langste Ubereinstimmungslange und ein Index 
eines Eintrags, der einen Zeiger speichert, der den Kopf der Zeichenkette 
mit der langsten Ubereinstimmung angibt, werden codiert, und ein sich 
eingebender Code wird in einen Codespeicher (105, 205) geschrieben. Ein 
Eintrag, der den Zeiger speichert, der den Kopf der codierten 
Zeichenkette angibt, wird dem Lexikon (100) hinzugefugt, urn es zu 
aktualisieren. 

Main International Patent Class: H03M-007/30 
International Patent Class: H03M-007/42 ... 
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Fulltext Availability: 

Description (English machine translation) 
Description (English machine translation) 

compressed code remains, with the next procedure 

one begins . 

Fig. 10 shows details by conversion tables 

actualization installation 203 of accomplished procedure. In 
step 1000 it is judged whether the number of the entries registered in 
the conversion table 200 achieved a maximum number of the entries, 
which is stopped to same way as for the encyclopedia used in the 
compression procedure. 

If an evaluation result of step indicates 1000 the fact 

that the conversion table 200 has still place, in order to raise a new 
entry, goes the procedure for. . . 



E*ile 347:JAPIO Nov 1 97 6-2005/Feb (Updated 050606) 

(c) 2005 JPO & JAPIO 
File 350:Derwent WPIX 1963-2005/UD, UM &UP=200535 

(c) 2005 Thomson Derwent 
File 344 : Chinese' Patents Abs Aug 1985-2005/May 

(c) 2005 European Patent Office 
File 371: French Patents 1 961-2002/BOPI 200209 

(c) 2002 INPI. All rts. reserv. 

Set Items Description 

51 648635 MAXIMUM OR MAXIMIS? OR MAXIMIZ? OR GREATEST OR LARGEST OR - 

HIGHEST OR OPTIMAL? OR OPTIMUM? OR OPTIMIS? OR OPTIMIZ? 

52 17478 S1(2W) (ENTRY? OR ENTRIES OR POPULAT? OR CONTENT? ? OR NUMB- 

ER? ? OR QUANTITY? OR QUANTITIES OR NUMERIC?? ?) • 

53 2867 S1(2W) (WORD? ? OR LEXEME? OR MORPHEME? OR TERM OR TERMS OR 

TEMINOLOGY OR UNIT OR UNITS OR ITEM OR ITEMS) 

54 210 NGRAM? ? OR BIGRAM? ? OR (BI OR N) ( ) GRAM? ? 

55 1221720 COMPRESS? OR COMPACT? OR PACK? ? OR PACKING OR PACKED 

56 3272904 LENGTH? OR SIZE? ? OR DIMENSION? ? OR VOLUME? ? OR PROPORT- 

ION? ? OR SPACE OR SPATIAL 

57 290411 S6(2N) (MINIMI? OR SHORT?? ? OR SHORTEN? OR CONDENS??? ? OR 

CONDENSAT? OR REDUC????? ? OR RED? ? OR DIMIN? OR DECREAS? OR 
DECREM? OR ABBREVIAT? ) 

58 11288 S6(2N) (KEY? ? OR STRING? ? OR WORD? ? OR CHARACTERSTRING? - 

OR SUBSTRING? OR LEXEME? OR MORPHEME? OR TERM OR TERMS) 

59 25054 LEXIC??? ? OR VOCABULAR? OR DICTIONAR? 

510 800909 CACHE? ? OR CONTAINER? OR RECEPTACLE? OR DATABASE? OR DATA- 

SET? OR DATABANK? OR DATASTORE? OR DATAFILE? OR DATACOLLECTIO- 
N? 

511 3 DATALIBRAR? OR DATADEPOSIT? OR DATAREPOSIT? OR DATAWAREHOU- 

S? 

512 162847 DATA ( ) (BASE? ? OR SET? ? OR BANK? ? OR STORE? ? OR STOREHO 

USE? OR FILE? ? OR COLLECTION? ? OR LIBRAR? OR DEPOSIT? OR RE 
POSIT? OR WAREHOUS? OR WARE () HOUSE? ) 



S13 


2327 


S10:S12 (3N) (WORD 


? ? OR LEXEME? OR MORPHEME? OR TERM OR 




MS) 






S14 


583767 


TABLE OR TABLES < 


OR TABULAR? OR ARRAY? ? OR HASHTABLE? 


S15 


2341123 


SEARCH? OR RETRIEV? OR FETCH? OR MATCH? OR COMPAR??? ? 




COMPARISON? OR COMPARAT? OR EXTRACT? OR QUERY? OR QUERIE? 


S16 


1369 


S2:S3 AND (S4 :S5 


OR S7) 


S17 


22 


S16 AND S8 




S18 


12 


S16 AND S9 




S19 


1 


S16 AND S13 




S20 


110 


S16 AND S14 




S21 


27 


S20 AND S15 




S22 


76720 


IC='G06F-017/30' 


:IC='G06F-017/39' 


S23 


27291 


IC= , H03M-007/30 l 


:IC- , H03M-007/48 ! 


S24 


976 


MC=W01-A02A 




S25 


9087 


MC=U21-A05A2 




S26 


12 


S20 AND S22:S25 




S27 


68 


S17:S19 OR S21 OR S26 


S28 


68 


ID PAT (sorted in 


duplicate/non-duplicate order) 


S29 


68 


IDPAT (primary/non-duplicate records only) 



? t29/9/20 



29/9/20 (Item 20 from file: 350) 

DIALOG (R) File 350:Derwent WPIX 

(c) 2005 Thomson Derwent. All rts. rese'rv. 

012448055 ' **Image available** 
WPI Acc No: 1999-254163/199921 
XRPX Acc No: N99-189246 

Data compression apparatus for textual database management system 

Patent Assignee: UNISYS CORP (BURS ) 
Inventor: COOPER A B; MAIMONE W D 
Number of Countries: 001 Number of Patents: 001 
Patent Family : 

Patent No Kind Date Applicat No Kind Date Week 

US 5893102 A 19990406 US 96761252 A 19961206 199921 B 

Priority Applications (No Type Date) : US 96761252 A 19961206 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 5893102 A 25 G06F-017/30 

Abstract (Basic) : US 5893102 A 

NOVELTY - A linked tree structure consisting of list of word phrase 
reference with respect to each word is stored in a dictionary 
database . The input word is compared with stored word list and 
maximum correspondence is judged. The judged word is added to list and 
the words are converted to compressed phrase codes by LZW procedure. 

DETAILED DESCRIPTION - An extended phrase is obtained by adding 
each maximum correspondence judged words and is stored. A 
compression phrase code is assigned to each phrase based on which 
compression is carried out. Each node of link tree structure 
represents words with root and child nodes. A parent pointer and child 
pointer are provided in each node, to represent the words 
correspondence. An INDEPENDENT CLAIM is included for data compression 
method. 

USE - For textual database management system in telephone company. 
ADVANTAGE - The query searching time is reduced as the phrase is 
formed in search tree before starting of actual compression . Ensures 
effective compression of the textual data by using dictionary based 

compression , thereby reduces storage space . The retrieval speed 
of textual data is increased as the phrases are utilized in . the 
dictionary . 

DESCRIPTION OF DRAWING (S) - The figure shows block diagram of 
textual database system with compression apparatus, 
pp; 25 DwgNo 1A, IB/ 9 
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Abstract (Basic) : EP 678986 A 

The data compression system generates from an original data 
stream working data streams. Two sequential working data strings are 
combined to form a combined string. A dictionary is generated by 
registering the combined strings with occurrence frequency higher than 
a given value with a dictionary number. 

The combined strings in the data stream are replaced with the 
dictionary numbers corresponding to the combined strings in the 
dictionary . Data stream of digital input signal is stored in store (1) 
and data compression is performed with registering strings appearing 
in data stream in dictionary in second store (2) . 

ADVANTAGE - Data transmission period and decompression period can 
be considerably shortened so as to permit high speed image display in 
case of image data. 

Dwg.1/13 

Abstract (Equivalent) : US 5604495 A 

A data compression method, in which a data stream of a digital 
input signal is stored in first storage means and data compression of 
the data stream is performed with registering strings appearing in said 
data stream in a dictionary of second storage means, comprising: 

performing first process step for designating R representing a 
repetition number of the following third to fifth process steps wherein 
R is an integer smaller than or equal to D, with taking a total data 
capacity of the original data stream being S multiply N bits, wherein S 
is an integer greater than or equal to 2 and N is an integer greater 



than or equal to 3, and a maximum dictionary register number D 
where D is 2s; 

performing second process step for separating the original data 
stream of S bit length stored in said storage means and generating a 
working data stream by adding a non-conducting identifier bit for 
conversion into a working data stream having (S+l) bit working strings; 

performing third process step for detecting combined string 
consisted of two sequential working strings in said working data stream 
up to D/Rth order in descending order from working string having 
largest occurrence frequency, and having occurrence frequency greater 
than or equal to 3; 

performing fourth process step for registering compression 
dictionary data of (S+l) bits consisted of dictionary number and 
compression identifier bit, in said second storage means, 
corresponding to each of combined strings detected by said third 
process step; and 

fifth process step for replacing combined string among combined 
strings in said working data stream matching with one of combined 
strings registered in said second storage means, with the compression 
dictionary data corresponding to the matching combined string, 

repeating said third to fifth process steps for R times with taking 
data stream replaced through said fifth process step as working data 
stream for outputting data stream stored in said first storage means 
and all combined string and compression dictionary data stored in 
said second storage means, after R times repletion, as compressed 
data . 
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ABSTRACT 



PROBLEM TO BE SOLVED: To provide a data compression unit that can quickly 
retrieve an address of a dictionary memory on which new information is 



going to be overwritten and where the power consumption of the dictionary 
memory can be reduced. 

SOLUTION: A dictionary memory 4B to build up a dictionary used for data 
compression consists of four memories 41-4 3 that are operated 
independently of each other. When a CPU 2 discriminates that the number of 
character data to be stored in, e.g. the memory 42 reaches a maximum 
registration number , the CPU 2 activates only the memory 42 that stores 
slave node addresses to overwrite new character data to the memory 42. 
Thus, a retrieval circuit 6 sequentially retrieves addresses to which no 
slave node address is written from a head address of the memory 42 that is 
in operation and starts 2nd and succeeding address retrievals from an 
address next to the address retrieved at the 1st retrieval. 

COPYRIGHT: (C) 2000, JPO 
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ABSTRACT 

PURPOSE: To provide a data compressing device and its method which can 
quickly grow a dictionary and can improve the data compressing effect 
by registering plural partial data strings in a single dictionary 
registration processing when the partial input data strings are 
sequentially registered in the dictionary and then the input data are 
compressed and coded by referring to the dictionary . 

CONSTITUTION: A data compressing device 102 consists of a longest string 
coincidence retrieving part 104 which retrieves the longest one of symbol 
strings registered in a dictionary 105 that is coincident with a partial 
data string of an input data string 101, a coding part 106 which codes the 
index of the longest coincident symbol string, and a dictionary register 
part 107 which registers a symbol that connects the symbols including the 
1st symbol of the coded longest coincident symbol string through the 
MaxEnt-th symbol decided by the largest register number set value 108 
to the longest coincident symbol string coded just before the relevant time 
point into the dictionary 105. 
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ABSTRACT 

PURPOSE: To provide a data compressing device which can keep the high 
data compressing efficiency and also can effectively use a memory. 
CONSTITUTION: A data compressing table 10 is continuously retrieved 

for each character or bit until the match is secured between the input 
data and the retrieving data in the maximum length. Then the compressed 

code word corresponding to the match retrieving data of the maximum 
length is transmitted in place of an input data train, and the retrieving 
data on the contents of the- unmatched input data length and its 
corresponding compressed code word are generated on the table 10. When 
the compressed code words shown in the least variable bit number exceed 
the maximum number that can be showin in a bit, the table 10 is 

updated. Thus a fixed data compression table is not required since the 

compressed code word is generated by the table 10. As a result, the 
high compressing efficiency is kept even with various text data. 
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03420228 **Image available** 
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03-083128 [JP 3083128 A] 
April 09, 1991 (19910409) 
OKAMURA KAZUYUKI 
NISHI TAT SUM I 

NEC CORP [000423] (A Japanese Company or Corporation), JP 
(Japan) 

01-221014 [JP 89221014] 
August 28, 1989 (19890828) 
[5] G06F-009/06 

4 5.1 (INFORMATION PROCESSING — Arithmetic Sequence Units) 
Section: P, Section No. 1221, Vol. 15, No. 255, Pg. 80, June 
27, 1991 (19910627). 



ABSTRACT 

PURPOSE: To shorten the average processing time required for acquisition of 
a record whose reference is requested by first regularly storing page 
information in a report file information table with a page as the unit 
unconditionally and repeatedly reconstituting the information table when 
the quantity of page information reaches a maximum 



number of entries 



CONSTITUTION: A page information storage means 12 successively stores page 
information including the page number, the line number, and the record 
address in a report file page information table 14 at the time of reading 
in records from a report file 2. When the quantity of page information 
stored in the report file page information table 14 by the page 
information storage means 12 exceeds the maximum number of entries of 
the report file page information table , a page information compressing 
means 13 extends page intervals to compress page information. Thus, the 
report file page information table 14 is dynamically reconstituted, and a 
page information retrieving means 15 retrieves page information in the 
report file page information table 14 to return the record address most 
suitable for the retrieval condition, and the record is quickly acquired. 
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Set Items Description 

51 5111145 MAXIMUM OR MAXIMIS? OR MAXIMIZ? OR GREATEST OR LARGEST OR 

HIGHEST OR OPTIMAL? OR OPTIMUM? OR OPTIMIS? OR OPTIMIZ? 

52 97914 S1(2W) (ENTRY? OR ENTRIES OR POPULAT? OR CONTENT? ? OR NUMB 

ER? ? OR QUANTITY? OR QUANTITIES OR NUMERIC?? ?) 

53 17679 S1(2W) (WORD? ? OR LEXEME? OR MORPHEME? OR TERM OR TERMS OR 

TEMINOLOGY OR UNIT OR UNITS OR ITEM OR ITEMS) 

54 3045 NGRAM? ? OR BIGRAM? ? OR (BI OR N) ( ) GRAM? ? 

55 1665778 COMPRESS? OR COMPACT? OR PACK? ? OR PACKING OR PACKED 

56 8617252 LENGTH? OR SIZE? ? OR DIMENSION? ? OR VOLUME? ? OR PROPORT 

ION? ? OR SPACE OR SPATIAL 

57 338242 S6(2N) (MINIMI? OR SHORT?? ? OR SHORTEN? OR CONDENS??? ? OR 

CONDENSAT? OR REDUC????? ? OR RED? ? OR DIMIN? OR DECREAS? OR 
DECREM? OR ABBREVIAT?) 

58 61385 S6(2N) (KEY? ? OR STRING? ? OR WORD? ? OR CHARACTERSTRING? 

OR SUBSTRING? OR LEXEME? OR MORPHEME? OR TERM OR TERMS) 

59 10324 6 LEXIC??? ? OR VOCABULAR? OR DICTIONAR? 

510 942548 CACHE? ? OR CONTAINER? OR RECEPTACLE? OR DATABASE? OR DATA 

SET? OR DATABANK? OR DATASTORE? OR DATAFILE? OR DATACOLLECTIO 
N? 

511 198 DATALIBRAR? OR DATADEPOSIT? OR DATAREPOSIT? OR DATAWAREHOU 

S? 

512 517758 DATA ( ) (BASE? ? OR SET? ? OR BANK? ? OR STORE? ? OR STOREHO 

USE? OR FILE? ? OR COLLECTION? ? OR LIBRAR? OR DEPOSIT? OR RE 
POSIT? OR WAREHOUS? OR WARE () HOUSE? ) 

513 8064 S10:S12(3N) (WORD? ? OR LEXEME? OR MORPHEME? OR TERM OR TER 



MS) 



S14 


1366471 


TABLE OR TABLES OR TABULAR? OR .ARRAY? ? OR 


HASHTABLE? 


S15 


9697019 


SEARCH? OR RETRIEV? OR FETCH? OR MATCH? OR 


COMPAR??? ? OR 




COMPARISON? OR COMPARAT? OR EXTRACT? OR QUERY? 


OR QUERIE? ? 


S16 


6406 


S2:S3 AND (S4:S5 OR S7) 




S17 


93 


S16 AND S8 




S18 


39 


S16 AND S9 




S19 
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S16 AND S13 




S20 


258 


S16 AND S14 




S21 


28 


S17 AND (S20 OR S15) 




S22 


85 


S20 AND S15 




S23 


152 


S18:S19 OR S21:S22 




S24 


59 


■ S23/2000:2005 




S25 


93 


S23 NOT S24 
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S27 


40 


S17/2000:2005 




S28 


32 


S17 NOT (S27 OR S23) 




S29 

9 


25 


RD (unique items) 
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Abstract: Parallel algorithms for lossless data . compression via 
dictionary compression using optimal, longest fragment first (LFF) , and 
greedy parsing strategies are described. Dictionary compression removes 
redundancy by replacing substrings of the input by references to string 
stored in a dictionary . Given a static dictionary stored as a suffix 
tree, we present a CREW PRAM algorithm for optimal compression which runs 
in 0(M+log M log n) time with 0(nM/sup 21) processors, where it is assumed 
that M is the maximum length of any dictionary entry. Under the same 
model, we give an algorithm for LFF compression which runs in'0(log/sup 
2/ n) time with O(n/log n) processors where it is assumed that the maximum 

dictionary entry is of length O(log n) . We also describe an 0(M+log n) 
time and 0(n) processor algorithm for greedy parsing given a static or 
sliding-window dictionary . For sliding-window compression , a different 
approach finds the greedy parsing in O(log n) time using 0(nM log M/log n) 
processors. Our algorithms are practical in the sense that their analysis 
elicits small constants. (16 Refs) 
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51 7 614327 MAXIMUM OR MAXIMIS? OR MAXIMIZ? OR GREATEST OR LARGEST OR 

HIGHEST OR OPTIMAL? OR OPTIMUM? OR OPTIMIS? OR OPTIMIZ? 

52 157248 S1(2W) (ENTRY? OR ENTRIES OR POPULAT? OR CONTENT? ? OR NUMB 

ER? ? OR QUANTITY? OR QUANTITIES OR NUMERIC?? ?) 

53 55564 S1(2W) {WORD? ? OR LEXEME? OR MORPHEME? OR TERM OR TERMS OR 

TEMINOLOGY OR UNIT OR UNITS OR ITEM OR ITEMS) 

54 161 NGRAM? ? OR BIGRAM? ? OR (BI OR N) ( ) GRAM? ? 

55 1365271 COMPRESS? OR COMPACT? OR PACK? ? OR PACKING OR PACKED 

56 7081340 LENGTH? OR SIZE? ? OR DIMENSION? ? OR VOLUME? ? OR PROPORT 

ION?.? OR SPACE OR SPATIAL 

57 206052 S6(2N) (MINIMI? OR SHORT?? ? OR SHORTEN? OR CONDENS??? ? OR 

CONDENSAT? OR REDUC????? ? OR RED? ? OR DIMIN? OR DECREAS? OR 
DECREM? OR ABBREVIAT?) 

58 95502 S6(2N) (KEY? ? OR STRING? ? OR WORD? ? OR CHARACTERSTRING? 

OR SUBSTRING? OR LEXEME? OR MORPHEME? OR TERM OR TERMS) 

59 162351 LEXIC??? ? OR VOCABULAR? OR DICTIONAR? 

510 1315086 CACHE? ? OR CONTAINER? OR RECEPTACLE? OR DATABASE? OR DATA 

SET? OR DATABANK? OR DATASTORE? OR DATAFILE? OR DATACOLLECTIO 
N? 

511 1368 DATALIBRAR? OR DATADEPOSIT? OR DATAREPOSIT? OR DATAWAREHOU 

S? 

512 327259 DATA ( ) (BASE? ? OR. SET? ? OR BANK? ? OR STORE? ? OR STOREHO 

USE? OR FILE? ? OR COLLECTION? ? OR LIBRAR? OR DEPOSIT? OR RE 
POSIT? OR WAREHOUS? OR WARE () HOUSE? ) 

513 14425 S10:S12(3N) (WORD? ? OR LEXEME? OR MORPHEME? OR TERM OR TER 

MS) 

514 1973047 TABLE OR TABLES OR TABULAR? OR ARRAY? ? OR HASHTABLE? 



S15 


8969467 


SEARCH? OR RETRIEV? OR FETCH? OR MATCH? OR COMPAR??? ? 




COMPARISON? OR COMPARAT? OR EXTRACT? OR QUERY? OR QUERIE? 


S16 


1779 


S2 : S3 (S) (S4 : S5 OR S7) 


S17 


35 


S16 (S) S8 


S18 


11 


S16(S)S9 


S19 


2 


S16(S)S13 


S20 


131 


S16(S)S14 


S21 


77 


S20(S)S15 


S22 


116 


S17:S19 OR S21 


S23 


85 


S22/2000:2005 


S24 


31 


S22 NOT S23 


S25 


28 
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01485274 01-36262 
Compression theory 

McGillis, Peggy; Nichols, Mina; Terry, Britt 
Computer Technology Review PP: 60-61+ Summer 1997 
ISSN: 0278-9647 JRNL CODE: CTN 
WORD COUNT: 24 98 

...TEXT: symbol. The longer the match, the better the compression ratio. 

An advantage to using a dictionarybaseci method is that dictionary 
entries may be of various lengths. For instance, an incrementing pattern 
OOh to FFh may require only one entry into the dictionary . Patterns 
consisting of continuous repeating data, such as all FFh or all OOh, will 
compress very efficiently assuming the maximum dictionary word 
length is sufficiently large to describe the repeating portion. The 
importance of this technology in today. . . 

. . .many tablespaces of database files contain repeating text which can 
easily be included in the dictionary . 



This method of data compression focuses primarily on the encoding 
dictionary. Simple coding methods generally... 
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51 8447158 MAXIMUM OR MAXIMIS? OR MAXIMIZ? OR GREATEST OR LARGEST OR 

HIGHEST OR OPTIMAL? OR OPTIMUM? OR OPTIMIS? OR OPTIMIZ? 

52 130099 S1(2W) (ENTRY? OR ENTRIES OR POPULAT? OR CONTENT? ? OR NUMB 

ER? ? OR QUANTITY? OR QUANTITIES OR NUMERIC?? ?) 

53 66683 S1(2W) (WORD? ? OR LEXEME? OR MORPHEME? OR TERM OR TERMS OR 

TEMINOLOGY OR UNIT OR UNITS OR ITEM OR ITEMS) 

54 111 NGRAM? ? OR BIGRAM? ? OR (BI OR N) ( ) GRAM? ? 

55 1784071 COMPRESS? OR COMPACT? OR PACK? ? OR PACKING OR PACKED 

56 7693568 LENGTH? OR SIZE? ? OR DIMENSION? ? OR VOLUME? ? OR PROPORT 

ION? ? OR SPACE OR SPATIAL 

57 258068 S6(2N) (MINIMI? OR SHORT?? ? OR SHORTEN? OR CONDENS??? ? OR 

CONDENSAT? OR REDUC????? ? OR RED? ? OR DIMIN? OR DECREAS? OR 
DECREM? OR ABBREVIAT?) 

58 96888 S6(2N) (KEY? ? OR STRING? ? OR WORD? ? OR CHARACTERSTRING? 

OR SUBSTRING? OR LEXEME? OR MORPHEME? OR TERM OR TERMS) 

59 141592 LEXIC??? ? OR VOCABULAR? OR DICTIONAR? 

510 2648497 CACHE? ? OR CONTAINER? OR RECEPTACLE? OR DATABASE? OR DATA 

SET? OR DATABANK? OR DATASTORE? OR DATAFILE? OR DATACOLLECTIO 
N? 

511 2371 DATALIBRAR? OR DATADEPOSIT? OR DATAREPOSIT? OR DATAWAREHOU 

S? 
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POSIT? OR WAREHOUS? OR WARE () HOUSE? ) 

513 30257 S10:S12(3N) (WORD? ? OR LEXEME? OR MORPHEME? OR TERM OR TER 
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COMPARISON? OR COMPARAT? OR EXTRACT? OR QUERY? OR QUERIE? ? 

516 2188 S2 : S3 ( S ) ( S4 : S5 OR S7) 

517 40 S16(S)S8 

518 2 S16(S)S9 

519 1 S16(S)S13 

520 86 S16(S)S14 

521 21 S20(S)S15 

522 62 S17:S19 OR S21 

523 23 S22/2000:2005 



524 39 S22 NOT S23 
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526 25 S2072000:2005 

527 43 S20 NOT (S26 OR S22) 
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Set Items Description 

SI 3 AO=PECHENY B? 

? tl/ti/all 

1/TI/l (Item 1 from file: 350) 

DIALOG (R) File 350: (c) 2005 Thomson Derwent. All rts. reserv. 

Desulphurising heavy petroleum residues - by heating, applying voltage 
and adding barium hydroxide and one or both of ferric oxide and red lead 
in defined ratios 

l/TI/2 (Item 2 from file: 350) 

DIALOG (R) File 350: (c) 2005 Thomson Derwent. All rts. reserv. 

Desulphurisation of heavy oil residues - by heating and passing current 
in the presence of oxide (s) of iron and/ or lead and barium hydroxide 
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METHOD OF DESULPHURATION OF HEAVY OIL RESIDUES 
PROCEDE DE DE SULFURAT I ON DE RESIDUS D 1 HUILE LOURDE 



